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ABSTRACT 

By comparing a collisionless cosmological N-body simulation (DM) to a smoothed 
particle hydrodynamics simulation (SPH) with the same initial conditions, we inves- 
tigate the correspondence between the dark matter subhalos produced by collisionless 
$> . dynamics and the galaxies produced by dissipative gas dynamics in a dark matter 

background. When galaxies in the SPH simulation fall into larger groups and become 
satellites, they retain local dark matter concentrations (SPH subhalos) whose mass is 
typically five times the galaxy baryonic mass (compared to the simulation's universal 
ratio ridm/^6 ^ 7.5). The more massive subhalos of the SPH simulation generally have 
corresponding subhalos of similar mass and spatial position in the DM simulation; at 
lower masses, there is still fairly good correspondence, but some DM subhalos are in 
different spatial positions and some have suffered tidal stripping or disruption. The 
halo occupation statistics of DM subhalos — the mean number of subhalos, pairs, and 
triples as a function of host halo mass — are very similar to those of SPH subhalos and 
SPH galaxies. The gravity of the dissipative baryon component amplifies the density 
contrast of subhalos in the SPH simulation, making them more resistant to tidal dis- 
ruption. Relative to SPH galaxies and SPH subhalos, the DM subhalo population is 
depleted in the densest regions of the most massive halos. The good agreement of halo 
occupation statistics between the DM subhalo and SPH galaxy populations leads to 
good agreement of their two-point correlation functions and higher order moments on 



large scales. The depletion of DM subhalos in dense regions depresses their clustering 
at R < l/i -1 Mpc. In these simulations, the "conversation" between dark matter and 
baryons is mostly one-way, with dark matter dynamics telling galaxies where to form 
and how to cluster, but the "back talk" of the baryons influences small scale clustering 
by enhancing the survival of substructure in the densest environments. 

Subject headings: galaxies: formation — large-scale structure of universe 



1. Introduction 

The idea that galaxies form by dissipation of the baryonic component within a collisionless dark 
matter halo has a long history (White Sz Rees 1978). The excellent agreement of the inflationary 
cold dark matter (CDM) model with a wide range of cosmological observations (e.g., Spergel et 
al. 2003; Seljak et al. 2005) puts this hypothesis on a firm theoretical footing. In the first stages 
of galaxy formation, dark matter does the talking: gravitational collapse produces dark matter 
potential wells that capture baryons, which radiate their energy and form dense objects at the 
halo centers. However, the subsequent details of the baryon-dark matter "conversation" are not 
so clear. Early N-body simulations showed that mergers of dark matter halos were followed by 
fairly rapid erasure of substructure, suggesting that dissipation in the baryonic component was 
crucial to the formation of groups and clusters with many distinct members, and that it was the 
gravity of the condensed baryons that allowed them to retain the central regions of their individual 
dark matter halos after falling into larger virialized systems. However, much higher resolution 
simulations in the late 1990s and thereafter showed that virialized halos retain a great deal of 
substructure (Ghigna et al. 1998; Klypin et al. 1999; Moore et al. 1999; Springel et al. 2001), and 
that the erasure of substructure in earlier simulations was largely an artifact of inadequate mass 
and/or force resolution. This result raises the possibility that baryon self-gravity is unimportant 
in producing groups and clusters, and that cooling and star formation merely produce "beacons" 
that mark the centers of dark matter structures that would be present even if baryons had no 
gravitational influence at all. 

In this paper, we investigate the dynamical interaction between baryons and dark matter 
using two numerical simulations of the same cosmological volume, one that incorporates both 
dark matter and a baryonic component modeled with smoothed particle hydrodynamics (SPH), 
and the other that starts from the same initial conditions but follows only the dark matter. We 
identify substructure in the dark matter distribution using a method that computes SPH-like density 
estimates within halos, then groups particles above saddle points in the density field. We are 
interested in the degree to which the presence of baryons alters the properties of substructure in 
the dark matter distribution and in the degree to which substructure in the purely gravitational 
simulation traces the galaxy population that forms in the hydrodynamic simulation. The latter 
issue is of practical as well of physical interest, since if the agreement is good one might be able to 



use N-body simulations in place of hydrodynamic simulations for galaxy clustering predictions. N- 
body simulations with the resolution needed to follow substructure are computationally expensive, 
but they are less expensive than full hydrodynamic simulations, thus allowing larger simulation 
volumes or wider searches of parameter space. We therefore pay particular attention to the halo 
occupation statistics of substructures vs. galaxies, since these in turn allow one to predict many 
different clustering statistics (Berlind & Weinberg 2002). 

There are several indirect indications that substructure in N-body simulations can provide 
good tracers of the galaxy population. First, Colin et al. (1999) and Kravtsov et al. (2004) show 
that the correlation functions of substructures in high resolution simulations agree quite well with 
observations, which in turn agree well with results from full hydrodynamic simulations (Cen & 
Ostriker 2000; Pearce et al. 2001; Yoshikawa et al. 2001; Weinberg et al. 2004). Conroy et al. 
(2006) show that the agreement with observations extends to a wide range of redshifts and galaxy 
space densities. Second, Berlind et al. (2003) show that there is remarkably good agreement between 
the halo occupation distribution found in our SPH simulations and those predicted by semi-analytic 
galaxy formation models of Cole et al. (2000). Since the treatment of cooling and star formation is 
quite different in the two methods, this agreement suggests that the halo occupation distribution 
is determined in large part by dark matter dynamics and halo merger histories, though even if true 
this does not guarantee that post-merger substructure will retain the information about the galaxy 
population. Third, Kravtsov et al. (2004) find that the halo occupation distribution of substructure 
in their high resolution N-body simulations is similar to that found in our SPH simulations by 
Berlind et al. (2003) and Zheng et al. (2005). While all of these results provide useful insight 
into the relative importance of dark matter dynamics and baryon dissipation in determining the 
spatial distribution of galaxies, this paper is the first, to our knowledge, to carry out the direct test 
of comparing galaxy populations in a hydrodynamic simulation of a cosmological volume to dark 
matter substructure in an N-body simulation with the same initial conditions. Nagai & Kravtsov 
(2005) have recently carried out a complementary experiment in simulations of galaxy clusters. 



2. Simulations 

We analyze two simulations with the same initial conditions, one run with dark matter only and 
one that incorporates a dissipative gas component and star formation. We hereafter refer to these 
as the DM and SPH simulations, respectively. The SPH simulation uses a parallel implementation 
(Dave, Dubinski, & Hernquist 1997) of TreeSPH (Hernquist & Katz 1989; Katz, Weinberg &; 
Hernquist 1996) to follow the evolution of 128 3 dark matter particles and 128 3 gas particles in 
a 22.222/i~ 1 Mpc comoving box, from z = 49 to z = 0. We adopt a ACDM cosmological model 
(inflationary cold dark matter with a cosmological constant) with parameters Q, m = 0.4, ^a = 0.6, 
n b = 0.02/i~ 2 , h = Fo/100 kms^Mpc- 1 = 0.65, n = 0.95, and a 8 = 0.8. Our choices of a 8 , 
Hq, n, and J^ are reasonably close to the recent estimates from cosmic microwave background 
anisotropies and large scale structure data (e.g., Spergel et al. 2003; Tegmark et al. 2004; Sanchez 



et al. 2006), while our value of £l m is somewhat high. The dark matter particle mass is 7.9 x 1O 8 M0, 
and the SPH particle mass is 1.05 x 1O 8 M . The gravitational force softening is a 5/i _1 comoving 
kpc cubic spline, roughly equivalent to a Plummer force softening of 3.5/i -1 comoving kpc. The 
DM simulation uses the same simulation code and the same initial positions and velocities of dark 
matter particles. It has the same numerical parameters, except that the dark matter particle mass 
is increased by a factor of S7 m /(O m — Q^) to 8.9 x 1O 8 M . 

Although the volume is much smaller, the mass resolution of the DM simulation is similar to 
that of the simulations used by Colin et al. (1999) and Kravtsov et al. (2004) to study the large 
scale clustering of subhalos (somewhat higher than their ACDMgo run with 256 3 particles in a 
60/i _1 Mpc box and somewhat lower than their ACDMso run with 512 3 particles in an 80/i _1 Mpc 
box), and it is slightly higher than that of the recent "Millennium Run" simulation (Springel et 
al. 2005; 2160 3 particles in a 500/i _1 Mpc box). It is not as high as the mass resolution in recent 
simulations focused on the substructure distribution in clusters (e.g., Diemand et al. 2004; Gao et 
al. 2004; Nagai & Kravtsov 2005). By chance, our simulation forms one halo that is unusually large 
for a 22.222/i _1 Mpc box given our cosmological parameters, with a mass of 4 x 10 M , allowing 
us to investigate substructure survival and baryonic influence in a Virgo-mass galaxy cluster (see 
Figure 3 below). This halo contains about 4.7 x 10 5 dark matter particles within its virial region. 
The next most massive halos have masses of ~ 3 x 1O 13 M . 

Details of the treatment of radiative cooling, star formation, and galaxy identification in the 
SPH simulation can be found in Katz, Weinberg & Hernquist (1996). In brief, gas can dissipate 
energy via Compton cooling and radiative cooling, computed assuming primordial composition and 
the photoionizing background field of Haardt & Madau (1996). Star formation occurs in regions that 
are Jeans unstable, above a threshold density (uh > 0.1cm" 3 ), and below a threshold temperature 
(T < 30, 000 K). We add the thermal energy from supernova feedback but it has relatively little 
impact, because it is usually deposited in a dense medium with a short cooling time. We identify 
galaxies using the Spline Kernel Interpolative DENMAX (SKID) 1 algorithm (Gelb & Bertschinger 
1994; Katz, Weinberg & Hernquist 1996), which identifies gravitationally bound clumps of stars 
and cold (T < 30, 000 K), dense (p g /p g > 1000) gas that are associated with a common density 
maximum. Comparisons among simulations with different resolution show that the locations and 
baryonic masses (stars plus cold, dense gas) of SKID galaxies are robust when the mass exceeds 
that of about 64 SPH particles, or 6.8 x 1O 9 M . For our lowest mass threshold sample in this 
paper, we take a slightly larger minimum mass of 7.1 x 1O 9 M . There are 1103 galaxies in the 
simulation volume above this mass, making the mean space density of this sample 0.1/i 3 Mpc . We 
also consider samples with higher minimum masses and lower mean space densities, as discussed in 
§4 below. 



1 We use the implementation of SKID by J. Stadel and T. Quinn, which is publicly available at 
http : //www-hpcc . astro . Washington . edu/tools/skid . html. 



3. Identification of Substructure 

Our method of identifying halos and substructures uses the publicly available code Adapt aHOP 
(Aubert, Pichon, & Colombi 2004), which derives from the HOP algorithm of Eisenstein & Hut 
(1998). It is very similar to the method used by Springel et al. (2001), though the implementation 
here is independent. The algorithm is detailed in Aubert, Pichon, & Colombi (2004), so we give 
only a brief summary of it here. We first calculate densities around each dark matter particle 
using an SPH-like kernel estimator, with a cubic spline kernel containing 32 neighbors (just as in 
the SPH simulation itself). During that operation, we store as well the iVh op nearest neighbors of 
each particle, with iVh op = 16 as advocated by Eisenstein & Hut (1998). Then, we partition the 
ensemble of particles into "peak patches" . A peak patch is a set of particles with the same local 
density maximum, identified by connecting each particle to its densest neighbour among its iVhop 
closest to track the local density gradient. The connectivity between the peak patches is dictated 
by the saddle points in the density field. These points are found by locating local maxima in the 
boundaries between peak patches. In fact, for each pair of peak patches connected through at 
least one saddle point, one needs only the saddle point with highest density. Then one is ready 
to construct an ensemble of trees, the haloes, and the branches of the trees and their leaves, the 
substructures, each leaf corresponding to a unique local maximum, or peak patch. 

In this representation, a halo is defined as a connected group of particles with overdensity 
p/p > 81, as advocated by Eisenstein & Hut (1998); a leaf is defined as a subset of particles 
in a peak-patch with SPH density larger than p s , where p s is the density of the highest saddle 
point connecting this peak patch to a neighbouring one. In order to select substructures which are 
statistically significant compared to Poisson noise, we impose a 4a level threshold, 



\Pl substructure -> Ps 



1 + 



(1) 



where N is the number of particles in this substructure (with SPH density above p s ) and (p) substructure 
is the average SPH density in this substructure. A substructure not following this constraint is 
absorbed by the neighboring substructure connected to it through the highest saddle point. This 
operation is performed recursively until condition (1) is fullfilled. Finally, note that most of haloes 
do not have any substructure, or equivalently, only one, the halo itself. In our representation, and 
in what follows, such a halo is considered simultaneously as a halo and a substructure. A higher 
resolution simulation would presumably reveal substructures in these low mass halos, but they 
would be below the mass threshold of the resolved galaxy populations that we consider below. 

To understand the procedure followed in AdaptaHOP, it is instructive to look at the 1-dimensional 
analogue shown in Figure 1. The densities at the particle locations define a 1-dimensional density 
field. Halos are connected regions above the overdensity threshold of 81 (horizontal dashed line), 
and there are three such halos in Figure 1 (horizontal thick lines). Saddle-points in 3-d correspond 
to local minima in this 1-d example, and the three halos in Figure 1 contain three substructures, two 
substructures, and no substructure, respectively. There is necessarily one and only one maximum 




Fig. 1. — Illustration of AdaptaHOP, in one dimension. In this example, saddle points are local 
minima. The haloes are connected structures above the density threshold. Three of them are 
detected, HI, H2 and H3, as defined by the thick horizontal lines. Each elementary substructure 
(a leaf) is associated to a local maximum. Therefore, HI, H2 and H3 contain respectively 3, 2 and 
1 leaves. A given substructure can be connected to other ones by saddle points. For instance S2 is 
connected to SI and S3 through 2 saddle points. To compute the extension of each substructure, we 
take only the points which have density larger than p s , where p s is the maximum value measured 
at the saddle points, as defined by the horizontal dotted lines. Note that halo H3 is its own 
substructure, S6. Its boundary is defined by the halo density threshold p/p = 81. 



between each pair of minima, and this maximum is identified as the location of the substructure. 
The mass of the substructure is the mass above the density threshold of the higher minimum, as 
indicated by the horizontal dotted lines in Figure 1. In most cases, this is a reasonable way of as- 
signing mass, but it tends to underestimate the mass of a large central object with a much smaller 
satellite. Note that the sum of the masses of the substructures in a halo is generally smaller than 
the halo mass itself, unless the halo does not contain any substructure. In the present paper, we 
use only the trees (haloes) and leaves (individual subhalos), but the substructure finder also builds 
the entire set of branches using saddle points as connectors. 



4. Results 

4.1. Formation of a Galaxy Group 

Figure 2 illustrates the formation history of a representative galaxy group. This group occupies 
the fourth most massive halo in the simulation, with a mass of 3.1 x 1O 13 M0. The left and middle 
columns show the dark matter particle distributions in the DM and SPH simulations, respectively, 
with particles color-coded according to local density estimated with the 32-particle spline kernel 
used in the substructure identification. Specifically, the lower panels show the dark matter particle 
distributions in the central 0.5/i _1 Mpc of this halo at z = 0; the full extent of the region within 
the p/p > 81 surface is about a factor of two larger. The top three rows show the distributions of 
the same particles at z = 2, 1, and 0.5, respectively (a small number of particles are missed off the 
bottom of these plots). At z = 2, these particles are spread over a region roughly 5/i -1 Mpc across 
(comoving), and many of them are clumped into small halos aligned along filamentary structures. 
Between z = 2 and z = 0.5, the small halos merge into larger halos, the underlying filamentary 
network becomes less evident, and the whole comoving volume shrinks slightly in size. 

The right hand column of Figure 2 shows the distribution of star particles, shown as green 
dots, and gas particles, color coded by temperature, from the SPH simulation. The temperature 
color scale runs logarithmically from T ~ 5 x 10 3 K (blue) to T ~ 10 7 K (yellow). Note that high 
density can make the stellar clumps visually inconspicuous even when they are fairly massive; we 

Fig. 2. — (Following page.) Formation of a galaxy group, in a halo of mass 3.1 x 10 13 M Q . The lower 
left panel shows the dark matter particles in the central 0.5/i _1 Mpc of this halo at z = 0, in the DM 
simulation. Panels above it show the positions of the same particles at z = 3, 1, and 0.5 (top to 
bottom). The panel size is different at each redshift; the white bar is always 0.5/i _1 Mpc comoving. 
Central panels show the corresponding dark matter particle distributions in the SPH simulation. 
Right hand panels show the distributions of gas and star particles at the same redshifts. Green 
points (plotted larger for visibility at z > 0) show star particles. Gas particles are color coded by 
temperature on a scale running from ~ 5 x 10 3 K (blue) to ~ 5 x 10 6 K (red). 
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have used larger dots for star particles at z > 0.5 so that the clumps remain visible. By z = 0, all 
of the gas that is not in galaxies has been heated to T ~ 5 x 10 6 K, but at earlier times much of 
the gas in filaments or the diffuse medium between them is cooler than 10 5 K. The high redshift 
panels also show clumps of dense gas that has cooled to T ~ 10 4 K but has not yet formed stars. 
The absence of stars in these clumps is primarily a numerical resolution effect --in tests with 
simulations of varying resolution, we find that the SPH code underestimates star formation rates 
in objects with less than ~ 200 particles. 

At z > 1, the dark matter distributions in the simulations with and without gas are nearly 
indistinguishable, and even at z = 0.5, the differences are small. Gas condensation and star for- 
mation occur at the centers of the larger dark matter concentrations. At z = 0, all of the larger 
galaxies are associated with a visually identifiable dark matter substructure. The largest substruc- 
tures are at similar locations in the DM and SPH simulations. Smaller substructures cannot be 
visually matched one-to-one between the two simulations. The gravity of the dense baryon clumps 
increases the density of subhalos in the SPH simulation, making them more visually prominent. 
However, we will show below that the number and mass distribution of subhalos is actually similar 
in the SPH and DM simulations, and that the level of one-to-one subhalo correspondence is more 
than meets the eye. 



4.2. Galaxy and Subhalo Populations 

Figure 3 shows the largest halo in the simulation, with a mass of 4.0 x 10 14 M Q . The left hand 
panels show the dark matter particle distribution in the SPH simulation (top) and DM simulation 
(bottom), with particles color-coded according to their local density, again estimated with the 
32-particle spline-kernel smoothing used in the substructure identification. The halo contains two 
major subcomponents within its p/p = 81 density boundary. One can see a large number of 
local density maxima in both the SPH and DM simulations. There is good correspondence in the 
positions of the larger density maxima, while the smaller density peaks are similar in number but 
do not correspond in position. One can also see that these local density peaks are systematically 
suppressed in the DM simulation near the centers of the two large clumps. A higher resolution 
simulation might preserve a larger degree of substructure in these innermost regions, but the SPH 
and DM simulations have the same mass and force resolution, so the differential effect of including 
the dissipative baryon component should be correct, at least qualitatively. Several groups have 
recently carried out detailed convergence tests for cluster substructure in N-body simulations and 
concluded that the suppression of substructure in cluster cores is for the most part a real effect 
of tidal stripping and disruption rather than a numerical artifact (Diemand et al. 2004; Gao et al. 
2004). 

The right hand panels of Figure 3 illustrate the application of AdaptaHOP to this halo. Black 
points show particles that have been assigned to a subhalo, and cyan points show particles that 
are assigned to the parent halo but not to a subhalo. One can see a good correspondence between 
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Fig. 3. — Dark matter and dark matter substructure for the largest halo in the simulation, with a 
mass of 4.0 x 1O 14 M at z = 0. Panels are 4/i _1 Mpc across. Left hand panels show the halo dark 
matter particles in the SPH simulation (top) and DM simulation (bottom). Particles are coded 
by local density, estimated using a spline kernel enclosing 32 neighbors. In the right hand panels, 
black points show particles that are members of subhalos identified by AdaptaHDP. Cyan points 
show particles that are not connected to one of these substructures but are above the overdensity 
81 threshold. 
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the positions of the largest identified subhalos in the two simulations, as expected from the density 
maps. It is hard to infer the mass of substructures from these plots because of saturation; denser 
subhalos often appear less massive because they are more compact. 

Figure 4 compares the subhalo populations of the two simulations to the galaxy populations 
of the SPH simulation, this time for the four most massive halos. The bottom row shows the same 
halo whose formation history is illustrated in Figure 2. In the left hand panels, each galaxy is 
represented by a circle whose area is proportional to its baryonic mass (stars plus cold gas). The 
smallest circles correspond to a mass of 7.1 x 10 9 Mq, slightly above our mass resolution limit. The 
middle panels show subhalos of the SPH simulation represented in the same fashion, except that all 
subhalo masses have been lowered by a factor of five (and the same point size scaling and minimum 
mass threshold have then been applied). Right hand panels show the DM simulation's subhalo 
population, with the same factor of five mass scaling. The limiting subhalo mass corresponds to 40 
dark matter particles. 2 

There is good agreement between the locations and scaled masses of the SPH galaxies and the 
DM subhalos in the SPH simulation, and this agreement holds almost all the way to the resolution 
limit except in the most massive halo. More remarkably, there is good agreement between the 
locations and scaled masses of subhalos in the DM simulation and the galaxies (and subhalos) in 
the SPH simulation. There are some positional differences, and these become larger for lower mass 
subhalos, so at low masses it is difficult to tell whether there is still a one-to-one correspondence 
between subhalos in the two simulations. 

Figures 5 and 6 show that the correspondence of subhalos remains good, though not perfect, 
even at fairly low masses. Figure 5 focuses on the fourth most massive halo, also shown in the 
bottom row of Figure 4. In the upper left panel, small dots show the particles in six of the seven 
largest SPH subhalos, with a different color for each; we have omitted the largest subhalo to 
preserve visual clarity. Dots in the upper right panel represent the corresponding particles in the 
DM simulation (i.e., those that had the same positions in the initial conditions), with the same color 
coding. In every case, there is a clear identification between an SPH subhalo and a DM subhalo, but 
particles in the DM simulation are more spread out. Since the subhalos are identified in the SPH 
simulation, it is virtually inevitable that the particle distributions will be more compact there, but 
the blue-dot halo, in particular, shows signs of substantial tidal stripping in the DM simulation. 
Of particular interest in this comparison is the magenta-dot subhalo, which is at a significantly 
different location in the DM simulation but has much the same particle content, though it, too, 
shows signs of some tidal stripping. 

The lower panels of Figure 5 show similar results for six of the lower mass subhalos, ranging 
from 50 particles (black points) to 156 particles (red points). The black-, red-, and magenta- 



2 In this and all subsequent figures, we have multiplied the particle masses in the SPH simulation by fi m /(fi„ 
fib) = 1.134, so that SPH and DM subhalos with the same number of particles are assigned the same mass. 
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Fig. 4. — Galaxy and subhalo content of the four most massive halos. Left hand panels show the 
SPH galaxies, middle panels show the dark matter subhalos in the SPH simulation, and right panels 
show the subhalos in the DM simulation. Each galaxy or subhalo is represented by a circle with 
an area proportional to its mass; the masses of the subhalos have been multiplied by 0.2 but are 
otherwise on the same scale as the galaxies. The upper panels are 4/i _1 Mpc on a side, while the 
other panels are 2/i _1 Mpc on a side. 
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Fig. 5. — Correspondence of particles in subhalos of the fourth largest halo of the SPH simulation 
(left panels) and the DM simulation (right panels). Circles show subhalos of the two simulations 
as in the bottom panels of Fig. 4. Colored dots show particles associated with particular subhalos 
in the SPH simulation (left) and the locations of the corresponding particles in the DM simulation 
(right). In the upper panels, the six most massive subhalos (after the most massive, central subhalo) 
are marked, while the lower panels show lower mass subhalos that illustrate a range of behaviors. 
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Fig. 6. — Masses of subhalos in the DM simulation vs. masses of the corresponding halos in the 
SPH simulation. Different panels show different host halo mass ranges as indicated. A DM subhalo 
is identified as corresponding to an SPH subhalo if it contains more than 1/4 of the same particles 
and is not more than a factor of four more massive; the latter condition eliminates spurious matches 
of tidally stripped particles to much larger subhalos. The lower right panel shows the fraction of 
SPH halos that have no matching DM halo by these criteria. Vertical arrows in each panel mark 
the masses corresponding to 32, 64, and 128 particles. 
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dot subhalos have maintained their identity in the DM simulation, though the latter two have 
shifted positions noticeably. The cyan-dot halo retains a core of the same particles at about the 
same location in the DM simulation, but many of its particles have been tidally stripped and are 
spread throughout the core of the halo. Finally, the blue-dot and green-dot halos appear to have 
been tidally disrupted, with their particle contents widely dispersed through the halo in the DM 
simulation. 

Figure 6 shows quantitative results for the full halo population. For each SPH subhalo, we 
match a DM subhalo if it contains more than 1/4 of the same particles. We suppress matches in 
which the DM subhalo is more than four times the mass of the SPH subhalo, since these cases arise 
when tidally stripped particles are attached to a different subhalo (typically the central one). The 
first three panels compare the masses of DM subhalos to masses of the matched SPH subhalos, 
in the 4 x 10 14 M o halo (upper left), the halos with 1O 13 M < M < 1O 14 M (upper right), and 
10 12 Mq < M < 10 13 Mq (lower left). The agreement in subhalo masses is generally very good, with 
somewhat larger scatter for the least massive subhalos in the most massive halo. The lower right 
panel shows the fraction of subhalos that are unmatched as a function of subhalo mass. For the 
most massive halo, this fraction rises to 50% for M su b ~ 10 11 M Q , corresponding to ~ 128 particles 
(rightmost vertical arrow). For the lower mass halos, the matched fraction is still ~ 75% at this 
M sub , and it does not fall to 50% until ~ 64 particles (10 13 M Q < M < 10 14 M Q ) or ~ 25 particles 
(10 12 M Q < M < 10 13 Mq). It is not clear whether the "missing" subhalo matches are primarily a 
consequence of physical disruption at low masses or numerical artifacts at low particle number, but 
our results suggest that one should be cautious in interpreting subhalo mass functions in cluster 
simulations below ~ 100 particles or ~ 1O 11 M0. 

Returning to the top row of Figure 4, one can see in the densest regions of the largest halo a 
slight paucity of subhalos (relative to the galaxies) in the SPH simulation and a more substantial 
lack of subhalos in the DM simulation. Figure 7 compares the radial number density profiles of 
galaxies and subhalos around the central galaxy of the main component of this halo. Here we 
use a mass threshold of 7.1 x 10 9 M Q for the galaxies and a mass threshold larger by f2dm/^6 f° r 
the subhalos, where f^dm = ^m — ^6- Since we normalize the profiles to the mean density of 
the corresponding galaxy or subhalo population in the entire simulation volume, the qualitative 
appearance of Figure 7 is not sensitive to the choice of mass thresholds. The radial profile of the SPH 
subhalos is only slightly depressed relative to the galaxies, but the DM subhalos are substantially 
depleted within R ~ 0.2/i _1 Mpc. This result agrees with other recent studies of subhalo depletion 
in cluster mass halos (Diemand et al. 2004; Gao et al. 2004; Nagai & Kravtsov 2005), although the 
impact of the baryons on the survival of subhalos appears somewhat stronger here than in Nagai 
& Kravtsov (2005). 

Can one use substructure in a high-resolution N-body simulation to identify the galaxy pop- 
ulation that would be found in a full hydrodynamic simulation? Here we will focus on galaxy 
populations defined by thresholds in baryonic mass (stars plus cold, dense gas), which should be 
similar but not identical to populations defined by thresholds in luminosity. Two subtleties then 
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Fig. 7. — Radial number density profile of galaxies (solid line) and subhalos (dotted and dashed 
lines for the SPH and DM simulations, respectively) around the most massive galaxy of the most 
massive halo. Galaxies are selected above a mass threshold Mg <m \ n = 7.1 x 1O 9 M0, and subhalos 
are selected above a mass threshold M 9)m i n J7dm/^6- Densities are normalized to the mean density 
of galaxies or subhalos above the mass threshold in the full simulation volume. Curves stop when 
the only interior galaxy or subhalo is the central object. 
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arise in trying to answer the question. First, since halos retain an enormous amount of substructure 
if one goes to sufficiently small mass scales (e.g., Moore et al. 1999; Springel et al. 2001), it is virtu- 
ally guaranteed that one can find "enough" substructures in each halo to correspond to the number 
of galaxies above a moderate or high baryonic mass threshold. However, the N-body/substructure 
approach has no predictive power unless one knows which (or at least how many) substructures to 
pick in each halo, so there must be some threshold in substructure mass for a given baryonic mass 
threshold. 

The second subtlety arises because a halo that falls into a larger halo (and thus becomes a 
substructure) starts to lose mass via tidal stripping. Since this process does not remove mass from 
the halo's central galaxy (at least until it is close to total disruption), the mass of a substructure 
containing a galaxy in a group or cluster will generally be smaller than the mass of an isolated 
halo that contains a similar galaxy in the "field." It is therefore unlikely that a simple global 
threshold in substructure mass is likely to work for identifying a galaxy population — if one picks 
the threshold based on the lowest mass halos that host such galaxies in the field, then there will 
be too few "galaxies" found in rich groups and clusters. One way to tackle this problem is to use 
circular velocity thresholds instead of mass thresholds, in the hope that the circular velocity remains 
a nearly monotonic function of the central galaxy's baryonic mass even if tidal stripping removes 
the outer parts of the halo in which the galaxy formed. This approach suffers from ambiguity in 
the choice of where to define the circular velocity, especially since tidal stripping alters the density 
profiles of substructures, making them systematically different from isolated halos (Stoehr et al. 
2002). Furthermore, recent N-body studies indicate that the circular velocities of subhalos do in 
fact decline as they are tidally stripped Nagai & Kravtsov (2005), so using circular velocity instead 
of subhalo mass only partly compensates for stripping effects. 

Here we have adopted a simple approach that seems to work surprisingly well. To identify a 
substructure population that corresponds to the galaxy population above mass threshold Mb <m [ n , 
we first apply a global mass threshold to the halo (not substructure) population. In the SPH 
simulation, we find the halo mass Mh tUl [ n at which 50% of halos (in a sample of 20 centered on 
that mass) contain a galaxy above -Mfc im ; n , and we eliminate halos with M < Mh tUl [ n . We use the 
same threshold mass in the DM simulation, except that we multiply M^ jm i n by f2 m /(f2 m — fib) to 
account for the fact that baryons are not counted when computing the mass of the SPH halos. In 
the left hand panel of Figure 8, crosses show the minimum halo mass (in the SPH simulation) as 
a function of the galaxy mass threshold. These lie close to the line 1.35(f2 m — £1{,)/Qb, indicating 
that these minimum mass halos have typically put about 75% of their available baryons into the 
central SPH galaxy. 

After eliminating halos below the mass threshold, we now apply a global mass threshold M s ^ m i n 
to the subhalo population, choosing its value so that the total number of subhalos in the simulation 
is equal to the total number of galaxies above the mass threshold. These subhalos are the "galaxy" 
population predicted by the subhalo method. The M s ^ m \ n threshold is always lower than Mh, m m 
because of the tidal stripping effects discussed above. Note also that a halo that passes the M^ m j n 
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Fig. 8. — (Left) Mass thresholds for identifying dark matter subhalos with SPH galaxies. Crosses 
show the halo mass M at which {N)m = 0.5 for galaxies above the threshold baryonic mass Mb im i n . 
When matching a subhalo population to a galaxy population, we only consider halos above this 
threshold mass. Triangles and circles show the subhalo mass threshold that is then required to 
match the space density of the galaxy population, in the SPH and DM simulations, respectively. 
For comparison, the dotted line shows 1.35 x ildm/^6 x -^ft,min> and the solid line shows 3.5M^ m i n . 
(Right) The average mass of galaxies (squares) above Mfc jmin compared to the average mass of dark 
matter subhalos (triangles and circles for the SPH and DM simulations, respectively) above the 
mass threshold indicated in the left panel. The solid line shows the mean galaxy mass multiplied 
by five. 
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threshold may not, in the end, contain a galaxy, since the mass of its largest substructure may be 
lower than M s>rn i n . These "unoccupied" halos (which are, in practice, fairly rare) should represent 
cases where the halo contains two or more galaxies below the M^ m [ n threshold instead of one (or 
more) above it. 

Triangles and circles in the left panel of Figure 8 show the threshold values required to match 
the galaxy and subhalo populations, for the SPH and DM simulations, respectively. The threshold 
for the DM simulation is approximately 3.5 times the galaxy baryonic mass, at all masses. The 
threshold for the SPH simulation is higher than this at low galaxy masses, suggesting that in this 
regime the baryonic clumps help to reduce tidal mass loss from their local subhalos. The right 
panel of Figure 8 shows the average masses of substructures above these thresholds. To a good 
approximation, the mean mass of subhalos above a threshold is simply five times the mean mass of 
galaxies above the corresponding threshold. We will show below that the halo occupation statistics 
and spatial clustering of the subhalo populations identified in this way are similar to those of 
the corresponding SPH galaxy populations. However, we first investigate the extent to which the 
presence of dissipative baryons in the SPH simulations alters the properties of the dark matter 
subhalos themselves. 



4.3. The Influence of Baryons on Halo Substructure 

Figure 9a compares the differential baryonic mass function of SPH galaxies to the differential 
mass functions of subhalos in the SPH and DM simulations. The two subhalo mass functions are 
similar, showing that the dissipative baryon component has only a small impact on this global 
measure of the subhalo population. The subhalo mass functions are similar in form to the galaxy 
mass function, shifted in mass scale by a factor of five, with a somewhat larger shift at low masses. 

The remaining panels show the subhalo mass function in the largest halo (Fig. 9b) and in 
halo mass ranges log M^/Mq = 13 — 14 (Fig. 9c) and 10.8 — 13 (Fig. 9d). Here we have not 
imposed any explicit threshold on the subhalo masses. The turnover of the subhalo mass functions 
at low masses is an artifact of the simulations' finite mass resolution, but since this resolution is 
the same in each case, we can use the differential comparison to investigate the influence of the 
baryon component on the survival of dark matter subhalos. These effects are generally mild, but 
they have the expected sign. In particular, the ability of dense baryon clumps to retain surrounding 
dark matter concentrations boosts the number of low mass subhalos in the largest halos, by up to 
a factor of two. 

Figure 10 examines the influence of baryons on the internal structure of subhalos. We define 
a simple measure of subhalo "concentration" by measuring a density for each particle using an 
SPH-like smoothing kernel containing 32 neighbors and taking the ratio of the highest density in 
the subhalo to the mean density of all particles in the subhalo. The strong trend of this concen- 
tration measure with subhalo mass is probably an effect of mass resolution — one can trace the 
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Fig. 9. — Differential mass functions, in number per (/i _1 Mpc) 3 per decade, of galaxies and sub- 
halos. The upper left panel shows results for the full simulation, with filled circles showing the 
baryonic mass function of SPH galaxies and squares and crosses showing the dark matter mass 
function of subhalos in the SPH and DM simulations, respectively. Here subhalos have been se- 
lected according to the two-stage thresholding procedure described in the text. The solid line 
shows the galaxy mass function shifted right by a factor of five. Note that bins are evenly spaced 
in logM but that some bins at high mass contain no objects. Remaining panels show the subhalo 
mass functions in bins of halo mass, as indicated. The lower limit of the 10.8 < logM^ < 13 bin 
corresponds to a mass of 64 dark matter particles. For these three panels we include all identified 
substructures, not just those selected by the thresholding procedure. 
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Fig. 10. — Average peak-to-mean density ratio of dark matter subhalos in the SPH simulation (solid 
line) and DM simulation (dotted line) . Densities are computed for each particle in the subhalo using 
an SPH-like smoothing kernel containing 32 neighbors, and the quantity shown is the ratio of the 
maximum density in the subhalo to the mean density of the subhalo, averaged in bins of subhalo 
mass. The trend with subhalo mass is probably a mass resolution effect, but the difference between 
the SPH and DM simulations shows the concentrating effect of cooled baryonic components within 
subhalos. 
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density profile of more massive subhalos into smaller radii — but we can again make a differential 
comparison. In contrast to Figure 9, the influence of baryons on subhalo concentrations is strong, 
with differences of a factor of several by this measure. This result is in qualitative agreement with 
the expectation that dissipative baryons lead to adiabatic contraction of their surrounding dark 
matter concentrations (Blumenthal et al. 1986), though a quantitative investigation of these effects 
is better carried out with higher resolution simulations of individual halos (e.g., Gnedin et al. 2004). 



4.4. Halo Occupation Statistics 

On scales that are large compared to the virial diameters of the largest halos, the clustering 
of the galaxy population is determined by the number of galaxies in each halo, regardless of their 
internal distribution within halos. If the halo occupation statistics of a population of galaxies 
and a population of subhalos are identical, then they will yield the same large scale results for all 
measures of clustering. Here we compare halo occupation statistics for mass-thresholded samples of 
SPH galaxies to those of matched subhalo samples defined by the two-stage thresholding procedure 
described in §4.2. 

Points in Figure 11 show the number of galaxies in each of the 30 most massive halos of the SPH 
simulation. The four panels correspond to four different baryonic mass thresholds, and the mean 
space densities of galaxies above these thresholds are 0.1, 0.05, 0.02, and 0.01/i 3 Mpc , respectively. 
Dotted lines show the number of subhalos in each of these halos above the mass thresholds indicated 
by the triangles in Figure 8a; by construction, the mean space density of these subhalos matches 
that of the corresponding galaxy population. Dashed lines show the number of subhalos in the 
same halos of the DM simulation, with mass thresholds shown by the circles in Figure 8a. The 
agreement between the number of galaxies and the number of subhalos in the matched population 
is extraordinarily good for both the SPH and DM simulations, at all four space densities. This 
agreement holds for the most massive halo despite the visible paucity of subhalos in the densest 
regions of this halo (Fig. 4). The subhalo mass threshold, chosen to give agreement with the global 
number density of galaxies in the simulation, has the effect of replacing these missing subhalos in 
the halo core with slightly less massive subhalos in the outskirts. 

Figure 12 examines the distribution of subhalo numbers in halos with N g = 1, 2, or 3 SPH 
galaxies (top to bottom), for samples with space densities of 0.1/i 3 Mpc (left hand panels) or 
0.02/i 3 Mpc (right hand panels). In each panel, solid histograms show the distribution of subhalo 
numbers in the SPH simulation, and dotted histograms show the distribution in the DM simulation. 
If the subhalo and galaxy populations agreed perfectly, these histograms would be P(N) = 1 for 
N = N g and P(N) = for all other N. The agreement is generally very good but not perfect. For 
example, of the 15 halos with N g = 2 for n = 0.02/i 3 Mpc , nine have two subhalos in the DM 
simulation, four have one subhalo, one has three, and one has four. The agreement for the SPH 
subhalos is usually better than for the DM subhalos, but not dramatically so. 
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Fig. 11. — Halo occupations of the 30 most massive halos in the SPH and DM simulations. Points 
connected by the solid black line represent galaxies, while the dotted red and dashed green lines 
represent the matched subhalo populations in the SPH and DM simulations, respectively. From top 
to bottom, the four panels correspond to increasing galaxy baryonic mass thresholds and decreasing 
population space densities, as indicated in each panel. 
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Fig. 12. — Distribution of subhalo occupations in halos with low galaxy occupation number. Top, 
middle, and bottom panels show results for halos that contain 1, 2, and 3 SPH galaxies, respectively, 
in the n = 0.1/i 3 Mpc sample (left) and the n = 0.02/i 3 Mpc sample (right). The corresponding 
halo masses can be inferred from Fig. 13. Solid histograms show the distribution of the number of 
subhalos above the matching mass threshold in the SPH simulation, and dotted histograms show 
the same distribution for the DM simulation. Perfect agreement would correspond to P(N) = 1 for 
N = N g and P(N) = otherwise, in which case the histograms would intersect the filled circles. 
Crosses show P(N) for a "control" sample in which the galaxy occupation of each halo is replaced 
by the galaxy occupation of the next halo in a list rank-ordered by mass. 
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The detailed agreement seen in the individual halo plots of Figure 4 and in the number counts 
of Figure 11 suggests that the subhalo counts are not just reproducing the mean number of galaxies 
at a given halo mass but are, to some degree, tracking the variation in galaxy number from halo to 
halo at each mass. To investigate this issue in the low occupancy regime, we rank-ordered the halo 
list by mass, then replaced each halo's galaxy occupation N g with the occupation TV' of the next 
halo on the list. Crosses in Figure 12 show P(N') — in essence, they show the effect of randomly 
replacing each halo's galaxy population with that of another halo of nearly identical mass. The 
subhalo method is doing "better than random" if the histogram lies above the cross in the N = N g 
bin and below the crosses in the other bins. For N g = 2, this is clearly the case; one can predict 
a halo's galaxy number more accurately using its SPH or DM subhalos than by using the galaxy 
number of another halo of similar mass. For N g = 1 and N g = 3, on the other hand, subhalos do at 
most slightly better than random assignment. Note, however, that the absolute level of agreement 
for N g = 1 is high, and that fewer than 10% of halos that have two subhalos above threshold 
contain only a single galaxy. 

Figure 13 compares the mean occupation functions {N)m of the two galaxy samples to those of 
the corresponding subhalo samples. The locations of the lower cutoffs match by construction, since 
we eliminate halos below the mass at which {N)m = 0.5 before choosing the subhalo population. 
In addition, matching the global space density of the subhalo population to that of the galaxy 
population forces agreement in the values of L dn/dM{N)MdM, where dn/dM is the halo mass 
function. However, it is clear that the agreement between the galaxy and subhalo occupation 
functions is far better than these constraints alone would require. The excellent match in halo 
occupations seen here and in Figure 11 implies that the large scale clustering of a mass-thresholded 
galaxy population and a properly matched subhalo population should be very similar in all respects. 

On small scales, the two-point correlation function is dominated by pairs of galaxies that reside 
in the same halo. In this one-halo regime, the important quantity for determining the two-point 
correlation function is {N(N — 1))m, the mean number of pairs per halo (Seljak 2000). Similarly, 
the three-point correlation function on small scales depends on the mean number of triples per 
halo (N(N - 1)(JV - 2)) M (Scoccimarro et al. 2001). Figures 14a and 14b plot (N(N - l))^ 2 and 

1/3 

(N(N — 1)(N — 2))m j respectively Taking square and cube roots allows direct comparison to the 
mean occupations plotted in Figure 13: if P(N\N avs ) is Poisson distributed, then (N(N — 1)(N — 
2))m = {N{N — 1))^ = (N)m- Figure 14 shows good agreement between the pair and triple 
counts of galaxy and subhalo populations in all cases. In particular, the subhalo counts reproduce a 
key feature of the galaxy counts, namely a distribution that is substantially narrower than a Poisson 
distribution at low occupation numbers. Berlind et al. (2003) and Kravtsov et al. (2004) discuss 
the physics behind these sub-Poisson count distributions: over the range in which (N) rises from 
one to a few, more massive halos tend to host more massive central galaxies or subhalos instead of 
hosting multiple objects above the mass threshold (see also Benson et al. 2000; Zheng et al. 2005). 

In the one-halo regime, galaxy clustering depends on the internal distribution of galaxies within 
halos, in addition to P(N\M). Therefore, agreement in (N(N - l)) M and (N{N - 1)(JV - 2)) M 
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Fig. 13. — The mean number of galaxies (points with error bars) and substructures (dotted line for 
the SPH simulation and dashed line for the DM simulation) in halos of mass M. Filled circles and 
upper lines represent the n = 0.1/i 3 Mpc sample, while open circles and lower lines represent the 
n = 0.02/i 3 Mpc sample. Error bars show the error on the mean, computed from the dispersion 
of TV among all halos in the bin divided by the square-root of the number of halos. The highest 
mass bin contains only a single halo, so no error bar is computed. 
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Fig. 14. — (Left) The square-root of the mean number of pairs per halo, (N(N — 1))^ , in the same 
format as Fig. 13. (Right) The cube-root of the mean number of triples per halo, (N(N — 1)(N — 

1/3 

2))m • If P(N\ (N)m) were Poisson distributed, then both of these quantities would trace the mean 
occupation functions (N)m, but comparison to Fig. 13 shows that the distributions for all three 
populations are substantially narrower than Poisson distributions at low occupation numbers. 
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does not guarantee agreement in two- and three-point correlations on small scales. We now turn 
to a direct investigation of small scale clustering as measured by the two-point correlation function 
and moments of counts-in-cells. 



4.5. f (r) and S n 

Because of the small simulation volume, the calculated galaxy clustering statistics are not 
good estimates of the global predictions for this cosmological model. However, since the DM and 
SPH simulations started from identical initial conditions, we can carry out differential comparisons 
between the clustering of SPH galaxies, SPH subhalos, and DM subhalos, and we expect the 
differences to be indicative of those that would arise in larger volumes. 

Figure 15 shows the two-point correlation functions of SPH galaxies and subhalos, for samples 
with a space density of 0.1/i 3 Mpc (left panel) and 0.02/i 3 Mpc (right panel). The strong curva- 
ture in the range r ~ 0.1 — 2/i~ 1 Mpc, especially evident for the 0.1/t 3 Mpc sample, is produced by 
the single large halo; if we eliminate all members of this halo before computing £(r), then the corre- 
lation functions have an approximately power-law form. At r > 0.5fo _1 Mpc, results for the galaxies, 
SPH subhalos, and DM subhalos converge, as expected based on the similarity of halo occupations 
shown in §4.4. At smaller scales, the two-point function of subhalos is depressed relative to that of 
galaxies, more strongly for the subhalos of the DM simulation, and more strongly for the sample 
with lower mass threshold (n = 0.1/i 3 Mpc ). This departure is primarily caused by the depletion 
of substructures in the densest regions of the largest halo, as seen in Figure 4. If we omit this halo, 
then the correlation functions of galaxies and subhalos track each other down tor ~ 0.2/j _1 Mpc, 
and only the n = 0.1/i 3 Mpc sample shows a clear flattening of the subhalo correlation functions 
at smaller scales. Colin et al. (1999) and Kravtsov et al. (2004) find subhalo correlation functions 
that retain an approximately power-law form down to r ~ 0.05/i _1 Mpc. Since depletion of sub- 
structure is important mainly in the largest halos, the difference between their results and those 
shown in Figure 15 can be explained in large part by the anomalously large contribution that the 
largest halo in our simulation volume makes to £,(r). Furthermore, the depletion of subhalos in our 
n = 0.1/i 3 Mpc sample could be artificially enhanced by numerical effects, since the subhalo mass 
thresholds for this sample correspond to only 53 and 23 dark matter particles in the SPH and DM 
simulations, respectively. 

As a simple measure of higher order correlations, Figure 16 shows the hierarchical moment 
ratios S3, S4, and S$ in spherical cells, estimated from shot-noise corrected moments of the galaxy 
and subhalo count distributions in 2048 3 cells. Here S n = (5™ )/(<5 2 ) n_1 , where (5™) is the n-th 
connected moment of the density field and (5 2 ) is the variance, both corrected for shot-noise (see, 
e.g., Szapudi & Szalay 1993). As with the two-point correlation function, the hierarchical moment 
ratios of galaxies and subhalos converge at scales R > l/j -1 Mpc, but they differ substantially at 
smaller scales, with SPH substructures slightly depressed and DM substructures more substantially 
depressed. Statistics for the n = 0.02/i 3 Mpc sample are poor, but for the n = 0.1/i 3 Mpc 
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Fig. 15. — Two-point correlation function of SPH galaxies (solid line), SPH subhalos (dotted 
line), and DM subhalos (dashed line). The left hand panel shows samples with a space density 
of 0.1/i 3 Mpc -3 and the right hand panel shows samples with higher mass thresholds and a space 
density of 0.02/i 3 Mpc~ 3 . 



O 



SPH galaxies 
substructures (SPH)- 
substructures (DM) 




2.5 



1.5 



o 



0.5 - 



T | I 

SPH galaxies 

substructures (SPH) 

-- substructures (DM) 




R/h" 1 Mpc 
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with a space density of 0.1/i 3 Mpc , and the right hand panel shows samples with higher mass 
thresholds and a space density of 0.02/i 3 Mpc . 
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sample, the depletion of substructure in the densest regions clearly has an increasing impact for 
higher moments of the density field, as one might expect. The convergence scale is larger for 
moment ratios (and for the variance of counts-in-cells) than for £(r) because the moments in cells 
of radius R are affected by structure on all scales smaller than R. 



5. Discussion 

Our comparison of matched SPH and N-body simulations illuminates several aspects of the 
baryon-dark matter "conversation." For the most part, this conversation is one sided: the dark 
matter talks, and the baryons listen. If one traces the history of a halo back in time (Figure 2), 
then the population of the progenitor halos, where gas condenses into galaxies, is nearly identical in 
simulations with and without gas. At z = 0, the positions and masses of the larger subhalos in each 
halo are similar between the two simulations. Smaller subhalos have different positions, presumably 
because small differences in the halo potential can modify their orbits, but in many cases even these 
lower mass subhalos can be matched based on particle memberships. Some SPH subhalos are tidally 
stripped or disrupted in the DM simulation. However, the mass function of subhalos is similar in 
the two cases, with baryonic effects producing only a modest enhancement, primarily for low mass 
subhalos in high mass halos. The dissipative baryon component does increase the internal density 
of subhalos, and while this has little effect on their masses in typical environments, it does enhance 
the survival of subhalos in the densest regions of massive halos. 

These results are not especially surprising, but they are certainly at the simple end of what 
might have been expected. Our most remarkable result is the success of a simple mass thresholding 
scheme in identifying subhalo populations that have nearly identical halo occupation statistics 
to SPH galaxy populations with the same mean space density. To some degree, the number of 
subhalos traces the variation of galaxy number in halos of fixed mass; in particular, halos with 
two significant subhalos are more likely to host two significant galaxies. For each galaxy mass 
threshold, the average mass of host subhalos is approximately five times the average baryonic mass 
of the galaxies themselves (of course, the value of this ratio is likely to depend on the adopted 
cosmological parameters) . 

Our results have encouraging implications for efforts to model galaxy clustering with the sub- 
halo populations of high resolution, dissipationless simulations (e.g., Colin et al. 1999; Kravtsov 
& Klypin 1999; Conroy et al. 2006) and to develop semi-empirical models of galaxy bias by mon- 
tonically matching galaxy luminosity functions to subhalo mass functions (Vale & Ostriker 2004, 
2005). The agreement in halo occupation statistics implies that SPH galaxies and dark matter 
subhalos should have similar large scale clustering statistics, and we indeed find good agreement 
in the two-point correlation function and the hierarchical moments S3, 64, and £5 on scales larger 
than ~ l/i _1 Mpc. However, we find significant depletion of dark matter substructure in the densest 
regions of our one cluster mass halo, in agreement with results from other groups (Diemand et al. 
2004; Gao et al. 2004; Nagai &: Kravtsov 2005). This depletion significantly affects clustering on 
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scales < l/j -1 Mpc, with galaxy clustering stronger than SPH subhalo clustering, which in turn is 
stronger than DM subhalo clustering. The impact on these global statistics might be exaggerated in 
our simulations by the dominance of the largest halo in our small volume, and higher mass resolu- 
tion might reduce the depletion effect itself to some degree. Nagai k Kravtsov (2005) also note that 
subhalo and galaxy density profiles are more similar if one selects subhalos based on the mass they 
have when they are accreted onto the main halo, rather than the final mass, which is preferentially 
reduced by tidal stripping in the inner regions. Conroy et al. (2006) show that selecting subhalos 
based on accreted mass yields good agreement with observed galaxy clustering over a wide range 
of redshifts and luminosities. Overall, the present day distribution of dark matter gives one a good 
idea of where to place galaxies, and the relation between subhalo mass and galaxy baryon mass is 
roughly monotonic even if one uses the final subhalo mass. Simple recipes for matching galaxy and 
subhalo populations fail in the densest environments, but the large scale clustering of galaxies is 
determined mainly by the gravitational dynamics of dark matter. 

We thank Andrey Kravtsov for informative discussions on the topic of subhalo clustering. This 
research was supported by NASA grant NAGS-13308, NSF grant AST0407125, and the French 
CNRS. DHW thanks the Institut d'Astrophysique de Paris for generous hospitality during the key 
phases of this work. 
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